A policy gradient method for semi-Markov decision processes with application to call admission control

نویسندگان

  • Sumeetpal S. Singh
  • Vladislav B. Tadic
  • Arnaud Doucet
چکیده

Solving a semi-Markov decision process (SMDP) using value or policy iteration requires precise knowledge of the probabilistic model and suffers from the curse of dimensionality. To overcome these limitations, we present a reinforcement learning approach where one optimizes the SMDP performance criterion with respect to a family of parameterised policies. We propose an online algorithm that simultaneously estimates the gradient of the performance criterion and optimises it using stochastic approximation. We apply our algorithm to call admission control. Index Terms Stochastic processes; Semi-Markov decision process; Policy gradient; Two-time scale; Call admission control. ∗Corresponding author. S. Singh is with the Signal Processing Group, Department of Engineering, Cambridge University, CB2 1PZ Cambridge, UK. Email: [email protected] Tel: +44 1223 332 784 Fax: +44 1223 332 662 V. Tadić is with the Department of Automatic Control and Systems Engineering, The University of Sheffield, S1 3JD Sheffield, UK. Email: [email protected] Tel: +44 114 222 5198 Fax: +44 (0)114 222 5661 A. Doucet is with the Signal Processing Group, Department of Engineering, Cambridge University, CB2 1PZ Cambridge, UK. Email: [email protected] Tel: +44 1223 332 676 Fax: +44 1223 332 662 February 14, 2005 DRAFT

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Call Admission Control in Wireless Ds-cdma Systems Using Reinforcement Learning

THAI) สาขาวิชาวิศวกรรมโทรคมนาคม ลายมือช่ือนักศึกษา ปการศึกษา 2549 ลายมือช่ืออาจารยที่ปรึกษา PITIPONG CHANLOHA : CALL ADMISSION CONTROL IN WIRELESS DS-CDMA SYSTEMS USING REINFORCEMENT LEARNING. THESIS ADVISOR : ASST. PROF. WIPAWEE HATTAGAM, Ph.D. 95 PP. ABSTRACT (ENGLISH) DIRECT-SEQUENTIAL CODE DIVISION MULTIPLE ACCESS (DS-CDMA)/ CALL ADMISSION CONTROL/ REINFORCEMENT LEARNING/ ACTOR-CRITIC REINFO...

متن کامل

Optimal Distributed Call Admission Control for Multimedia Services in Mobile Cellular Network

| There is a growing interest in providing multi-media services in mobile/wireless communication networks. Call admission control (CAC) is a key factor in supporting successfully these services. In this paper, we propose a distributed call admission control policy which is optimal from the standpoint of service providers because it maximizes their revenue. A semi-Markov decision process is empl...

متن کامل

Integrated voice/data call admission control for wireless DS-CDMA systems

This paper addresses the call admission control problem for multiservice wireless code division multiple access (CDMA) cellular systems when the physical layer channel and receiver structure at the base station are taken into account. The call admission problem is formulated as a semi-Markov decision process with constraints on the blocking probabilities and signal-to-interference ratio (SIR). ...

متن کامل

Call Admission Control for Multimedia Cellular Networks Using Neuro-dynamic Programming

We consider, in this paper, the call admission control (CAC) problem in a multimedia cellular network that handles several classes of traffic with different resource requirements. The problem is formulated as a Semi-Markov Decision Process (SMDP) problem. It is too complex to allow for an exact solution for this problem, so, we use a real-time neuro -dynamic programming (NDP) [Reinforcement Lea...

متن کامل

New Channel Assignments in Cellular Networks: A Reinforcement Learning Solution

The optimization of channel assignment in cellular networks is a very complex optimization problem and it becomes more difficult when the network handles different classes of traffic. The objective is that channel utility be maximized so as to maximize service in a stochastic caller environment. We address in this paper, the dynamic channel assignment (DCA) combined with call admission control ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • European Journal of Operational Research

دوره 178  شماره 

صفحات  -

تاریخ انتشار 2007